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Recent advances in genome and transcriptome analysis have 
enabled identification of numerous members of a new class of 
noncoding RNA, long noncoding RNA (IncRNA). IncRNAs are 
broadly defined as RNA molecules greater than 200 nt in 
length and lacking an open reading frame. Recent studies 
provide evidence that IncRNAs play central roles in a wide 
range of cellular processes through interaction with key 
component proteins in the gene regulatory system, and that 
alteration of their cell- or tissue-specific expression and/or their 
primary or secondary structures is thought to promote cell 
proliferation, invasion and metastasis. The biological and 
molecular characteristics of the large majority of IncRNAs 
remains unknown, and it is anticipated that improved 
understanding of the roles played by IncRNAs in cancer will 
lead to the development of novel biomarkers and effective 
therapeutic strategies. [BMB Reports 2012; 45(11): 604-611] 



INTRODUCTION 

Recent advances in genome analysis, including microarray 
and massively parallel sequencing, have shown that a much 
larger portion of the human genome is pervasively transcribed 
into RNA than previously recognized. Moreover, much of the 
evidence emerging in recent years has highlighted the bio- 
logical and pathological importance of RNA molecules that 
lack protein-coding potential; these are collectively referred to 
as noncoding RNAs (ncRNAs) (1). Long ncRNAs (IncRNAs) are 
broadly defined as transcribed RNA molecules greater than 
200 nt in length and lacking an open reading frame of sig- 
nificant length (less than 100 amino acids). Although there are 
no specific definitions, IncRNAs can be categorized into sev- 
eral subgroups based on their locations and characteristics. For 
instance, antisense RNAs are transcribed from the opposite 
strand of a protein-coding gene, while IncRNAs transcribed 
from intergenic regions are referred to as large intergenic 
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ncRNAs (lincRNAs). 

Much about the molecular and biological characteristics of 
IncRNAs is as yet unknown, but what is known suggests that 
many show expression patterns that are spatially and tempo- 
rally specific and are generally poorly conserved among 
species. Cabili et a/, recently used integrated RNA-seq data to 
construct a reference catalog of 8195 human lincRNAs (2). 
According to their reports, lincRNAs, like protein-coding tran- 
scripts, are transcribed by RNA polymerase II and are spliced 
and polyadenylated, but the maximum expression levels of 
lincRNAs are 10 times lower than those of protein-coding 
transcripts. lincRNAs are also smaller in size than protein-cod- 
ing RNAs and have fewer exons: on average, lincRNAs are ~1 
kb in length with 2.9 exons, whereas protein-coding transcripts 
are ~2.9 kb in length with 10.7 exons. In addition, lincRNAs 
are alternatively spliced more frequently than protein-coding 
mRNAs (2.3 isoforms per lincRNA locus, on average). It is also 
noteworthy that whereas 78% of examined lincRNAs exhibit 
tissue-specific expression patterns, only 19% of protein-coding 
transcripts do so. In fact, the tissue (cell or context) specificity 
of lincRNAs expression has been reported often enough (3, 4) 
to suggest that it is an important feature of lincRNAs. 

Although the specific functions of the large majority of 
IncRNAs remain unknown, recent studies have begun to shed 
light on the critical roles played by these molecules in a varie- 
ty of cellular processes, including differentiation, development 
and tumorigenesis. In this review, we will briefly outline the 
known functions of IncRNAs and their involvement in cancer. 

FUNCTIONS OF IncRNA 

Although, as mentioned, the function of most IncRNAs re- 
mains unknown, dozens of examples of biologically functional 
IncRNAs have already been reported, and the number of such 
examples is rapidly increasing (Table 1). Basically, IncRNAs 
appear to be involved in all aspects of gene regulation, includ- 
ing chromosome dosage-compensation, imprinting, epigenetic 
regulation, nuclear and cytoplasmic trafficking, transcription, 
mRNA splicing and translation. Through such gene regulation, 
IncRNAs are involved in a wide range of biological processes, 
including proliferation, cell cycle, apoptosis, differentiation 
and maintenance of pluripotency, among others. There is a 
good review of IncRNA's functions, focusing on biological and 
pathological processes in cancer (5); refer to that article for fur- 
ther details. On the other hand, it is also useful to look at 
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Table 1. Representative IncRNAs implicated in cancer 



Name 


Size 
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1 V U 1 ill 1 Ul I^LIL/I 1 


l\CICI CI 1 LCj 


Oncogenic 












HOTAIR 


2.2 kb 


Breast, hepatocellular, 


Promotes invasion and metastasis, 


Scaffold (PRC2, LSD1), 


(11, 26-29) 






colorectal, pancreatic, GIST 


Modulates cancer epigenome 


Guide {in trans) 




ANRIL 


2.2 Kb 


Prostate, leukemia, 


Suppresses senescence via INK4A 


Scaffold (PKL1 , PRC.2), 


(30-35) 






neural system, melanoma 




Guide {in cis) 




K A A 1 AT1 

MALA I I 


0./ kd 


Lung, prostate, breast, 


Regulates alternative splicing of 


Scaffold (nuclear paraspeckle) 


(J6-4U, 4/) 






colon, hepatocellular 


pre-mRNA 






PCAT-1 


1 .9 kb 


Prostate 


Promotes cell proliferation, 


Unknown 


(42) 








inhibits BRCA2 






rLUbM 1 


1 (; l,k 
I .b Kb 


Prostate 


Inhibits apoptosis, promotes cell 


Unknown 


MQ /10\ 

(48, 49) 








proliferation 






1 ULJio 


0.6 kb 


Hepatocellular 


Promotes cell proliferation 


Unknown 


(50) 


uc.73a 


0.2 kb 


Leukemia, colorectal 


Inhibits apoptosis, promotes cell 


Unknown 


(51) 








proliferation 






SPRY4-IT1 


0.7 kb 


Melanoma 


Promotes cell proliferation and 


Unknown 


(52) 








invasion, Inhibits apoptosis 






ncRAN 


2.3 kb 


Neuroblastoma, bladder 


Promotes cell proliferation and 


Unknown 


(53, 54) 








invasion 






PRNCR1 


13 kb 


Prostate 


Promotes cell proliferation 


Unknown 


(55) 


H19 


2.3 kb 


Breast, hepatocellular 


Promotes cell proliferation, both 


Unknown 


(56, 57) 








oncogenic and tumor suppressive 












functions reported 






Tumor suppressive 












GAS5 


0.6-1 .8 kb 


Breast 


Induces growth arrest and 


Decoy 


(21,44) 








apoptosis 


(glucocorticoid receptor) 




MEG3 


1.6-1.8 kb 


Meningioma, hepatocellular, 


Mediates p53 signaling 


Unknown 


(58, 59) 






leukemia, pituitary 


Inhibits cell proliferation 






PTENP1 


3.9 kb 


Prostate, colon 


Inhibits cell growth 


Decoy 


(23) 










(PTEN-suppressing miRNAs) 




LincRNA-p21 


3.1 kb 


Mouse models of lung, 


Induces apoptosis by repressing 


Guide (hnRNP-k, in trans) 


(10) 






sarcoma, lymphoma 


p53 targets 






Unknown, Biomarker 












HULC 


0.5 kb 


Hepatocellular 


Unknown 


Decoy (miR-372) 


(60, 61) 


PCA3 


3.6 kb 


Prostate 


Unknown 


Unknown 


(62) 


PANDA 


1.5 kb 


Unknown 


Inhibits apoptosis by sequestering 


Decoy (NF-YA) 


(4) 



NF-YA 



IncRNAs in the context of their molecular functionality (6). In 
this review we will focus on three molecular functions of 
IncRNAs: guide, scaffold and decoy. 

Guide 

One group of IncRNAs is able to bind specific proteins and 
then direct the localization of the resultant complex to specific 
targets. Such IncRNAs can guide proteins either in cis (on 
neighboring genes) or in trans (on distantly located genes). 
XIST, one of the most well-studied IncRNAs, guides PRC2 
(polycomb repressive complex 2) to one of the two X chromo- 
somes in cis to achieve X inactivation (7). Two other examples 
of IncRNAs that function as guides in cis are AIR and HOTTIP. 
AIR silences transcription of its target gene on the paternal 
chromosome by recruiting G9a and then mediating targeted 
histone H3 lysine 9 (H3K9) methylation and allelic silencing 
(8). HOTTIP, which is transcribed from the end of HOXA clus- 
ter, binds to WDR5 and recruits the MLL histone H3 lysine 4 
(H3K4) methyltransferase complex to the HOXA cluster to sup- 



port active chromatin confirmation (9). 

Some IncRNAs, including lincRNA-p21 and HOTAIR, are 
able to alter and regulate epigenetic states and gene ex- 
pression across multiple sites in trans (10, 1 1). LincRNA-p21 is 
transcribed upstream of CDKN1 A gene and acts as a transcrip- 
tional repressor through its interaction with hnRNP-K, which it 
guides to target sites. Knocking down lincRNA-p21 alters the 
expression of over 1,000 genes, suggesting lincRNAs may be 
able to regulate numerous genes in trans. In addition, some 
IncRNAs may serve as cellular "navigation systems" for pro- 
teins lacking direct DNA binding capacity (12). 

The mechanism by which IncRNAs specifically regulate 
their target genes remains unclear, and their binding sites 
throughout the genome are largely unknown. In one recent 
study, Chu and colleagues addressed this question using a 
novel assay they named ChIRP (Chromatin Isolation by RNA 
Purification)-seq, which is a method for genome-wide mapping 
of IncRNA binding sites in vivo (13). This analysis enabled 
them to obtain a high-resolution map of ncRNA occupancy 
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throughout the genome and to identify a set of 832 HOTAIR 
binding sites in human breast cancer cells. Interestingly, bind- 
ing sites for HOTAIR are focal (<500 bp) and located in the 
midst of a broad polycomb binding domain, which suggests 
HOTAIR may act as a pioneering factor able to recruit poly- 
comb to its target genes and then bilaterally spread the re- 
pressive regions outward. They also discovered an underlying 
DNA sequence motif enriched in HOTAIR binding sites, in- 
dicating the existence of a new class of regulatory element: 
IncRNA target sites. We now think that IncRNAs can function 
as sequence-specific transcription factors. 

Scaffold 

Another class of IncRNAs may possess distinct domains that 
bind different effector molecules. Such IncRNAs can mediate 
assembly of multiple molecular components in temporally and 
spatially specific manner. Two examples of IncRNAs that act 
as scaffolds are ANRIL, which interacts with components from 
PRC1 (polycomb repressive complex 1) and PRC2 (14, 15), 
and KCNQJOT1, which binds both G9a and PRC2 (16). In ad- 
dition, MALAT1 and NEAT1 serve as molecular scaffolds for 
proteins within nuclear speckles and paraspeckles, re- 
spectively (1 7). Depletion of NEAT1 is sufficient to cause loss 
of paraspeckles from within the nucleus, and overexpressing 
NEAT1, but not paraspeckle-associated proteins, leads to an in- 
crease in the number of paraspeckles, suggesting NEAT1 plays 
an essential role as a scaffold in the formation of paraspeckles. 

A recent study revealed that HOTAIR interacts with two 
chromatin modifying complexes, the PRC2 complex ("writer" 
of a repressive mark, H3K27 trimethylation) and the 
LSD1/CoREST H3K4 demethylase complex ("eraser" of an acti- 
vating mark, H3K4 trimethylation) (18). Using a series of dele- 
tion mutants, the PRC2 binding domain was mapped to the 5' 
end (the first 300 nt) of HOTAIR, while the LSD1 binding site 
corresponds to the 3' end. This suggests that HOTAIR that 
bridges between the PRC2 and LSD1 complexes, and that the 
resultant HOTAIR/PRC2/LSD1 complex can suppress gene ex- 
pression via multiple mechanisms. 

This finding is not applicable only to HOTAIR; many other 
IncRNAs also appear to interact with both the PRC2 and LSD1 
complexes. For example, Khalil and colleagues performed 
RIP-chip assays (RNA coimmunoprecipitation combined with 
high throughput lincRNA microarray) using antibodies di- 
rected against several proteins involved in chromatin modify- 
ing complexes (PRC2, CoREST and SMCX) (19). They found 
that as many as 38% of lincRNAs expressed in the cell types 
studied reproducibly associate with one of these complexes. In 
mouse embryonic stem cells, moreover, a number of lincRNAs 
were found to be strongly associated with multiple chromatin 
modifier complexes (20). For example, eight lincRNAs bind to 
the PRC2 H3K27 and ESET H3K9 methyltransferase complexes 
and the JARID1C H3K4 demethylase complex. Similarly, 17 
lincRNAs were found to bind to the PRC2, PRC1 and JARID1 B 
complexes. Taken together, these results suggest the attractive 



hypothesis that lincRNAs bind to ubiquitously expressed chro- 
matin modifying complexes in order to guide them to specific 
genomic regions. 

Decoy 

A third class of IncRNAs bind and then sequester a protein or 
RNA target, but do not exert additional effects. By acting as mo- 
lecular decoys, these IncRNAs negatively regulate the expression 
of their targets. Examples of IncRNAs with decoy functionality in- 
clude GAS5 (growth arrest-specific 5), which binds to the gluco- 
corticoid receptor (GR) and represses GR-induced genes (21); 
PANDA (P21 associated ncRNA DNA damage activated), which 
binds to the transcription factor NF-YA to negatively regulate ex- 
pression of pro-apoptotic genes (4); and TERRA (telomeric re- 
peat-containing RNA), which interacts with telomerase via a re- 
peat sequence to reduce enzyme activity (22). 

Interestingly, several pseudogenes also reportedly act as mo- 
lecular decoys. The 3'UTR of PTENP1, a tumor suppressor 
pseudogene, was found to bind the same set of regulatory 
miRNA sequences that normally target the tumor-suppressor 
gene PTEN, which reduces the downregulation of PTEN 
mRNA and allows its translation into the tumor-suppressor 
protein PTEN (23). It was also shown that the PTENP1 locus is 
selectively lost in human cancer, and that similar relationships 
also exist between other cancer-related genes and their pseu- 
dogenes (24). These findings attribute a novel biological func- 
tion to expressed pseudogenes, as they can regulate coding 
gene expression and reveal a non-coding function in mRNAs. 

DYSREGULATION OF IncRNAS IN HUMAN CANCER 
HOTAIR 

The lincRNA HOTAIR was originally discovered by Rinn and 
colleagues (25). It is encoded within the HOXC gene cluster 
and acts in trans to regulate HOXD genes through recruitment 
of PRC2 to induce trimethylation of H3K27 (H3K27me3). 
Remarkably, pull-down assays with PRC2 components demon- 
strated a direct and specific interaction with HOTAIR. This ob- 
servation that HOTAIR binds PRC2 and induces epigenetic si- 
lencing of another HOX cluster on a different chromosome 
was an unexpected and novel finding. 

In breast cancer, elevated expression of HOTAIR reportedly 
correlates with a poor prognosis and tumor metastasis (11). It 
is noteworthy that expression of a single lincRNA in primary 
tumors can be a powerful predictor of eventual metastasis and 
death. Enforced expression of HOTAIR induces genome-wide 
re-targeting of PRC2, leading to altered H3K27me3 and gene 
expression, and increased cancer invasiveness and metastasis. 
The link between HOTAIR and metastatic disease depends on 
both the direct interaction between the ncRNA and its protein 
partner and between the ncRNA and its target DNA sequence. 
Furthermore, several other studies have also shown that the 
level of HOTAIR expression correlates positively with meta- 
stasis and poor outcome in hepatocellular carcinoma, color- 
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ectal cancer and pancreatic cancer (26-28). We also dis- 
covered that upregulation of HOTAIR is strongly associated 
with aggressiveness in gastrointestinal stromal tumors (GISTs) 
(29). Interestingly, in malignant GISTs, HOTAIR is con- 
currently overexpressed with collinear HOXC genes and an 
oncogenic microRNA, miR-196a. We also observed enrich- 
ment of an active histone mark, H3K4me3, over a wide range 
of the HOXC cluster, suggesting the entire region is epigeneti- 
cally activated in malignant GISTs. Taken together, these re- 
sults suggest that lincRNAs play active roles in modulating the 
cancer epigenome and could be useful targets for cancer diag- 
nosis and therapy. 

ANRIL 

ANRIL (antisense IncRNA of the INK4 locus) is transcribed an- 
tisense to The INK4 locus and mediates INK4a transcriptional 
repression in cis (30). Independent studies have shown that 
overexpression of ANRIL in prostate cancer results in the si- 
lencing of INK4b/ARF/INK4a and p15/CDKN2B due to hetero- 
chromatin formation (14, 31). ANRIL interacts with SUZ12 
(suppressor of zeste 12 homolog), a subunit of PRC2 (15) and 
with CBX7 (chromo- box homolog 7), a subunit of PRC1 (14). 
ANRIL may recruit multiple sets of chromatin-modifying com- 
plexes to a target gene for silencing, serving as a molecular 
scaffold. Genome-wide association studies (GWAS) have 
linked ANRIL with increased susceptibility to coronary dis- 
ease, intracranial aneurysm and type 2 diabetes, as well as to 
several types of cancer, including acute lymphoblastic leuke- 
mia, glioma, basal cell carcinoma, nasopharyngeal carcinoma, 
breast cancer and plexiform neurofibromas (32-35). 

MALAT1 

MALAT1 (Metastasis-Associated Lung Adenocarcinoma Tran- 
script 1) was initially identified in a screen for genes associated 
with metastasis (36). It is abundantly expressed in many hu- 
man cell types and is highly conserved across several species. 
MALAT1 is reportedly upregulated in multiple malignancies, 
including lung cancer, uterine endometrial stromal sarcoma, 
cervical cancer and hepatocellular carcinoma (36-38). In lung 
metastasizing tumors, MALAT1 expression is three-fold higher 
than in non-metastasizing tumors (36), and it can serve as an 
independent prognostic parameter for patient survival in early 
stage lung adenocarcinoma (39). MALAT1 promotes the mo- 
tility of lung cancer cells through transcriptional or post-tran- 
scriptional regulation of motility-related genes (40). 

MALAT1 localizes to nuclear speckles, which contain sev- 
eral proteins known to be involved in alternative splicing (41). 
It appears that MALAT1 forms a molecular scaffold for several 
of the proteins present within nuclear speckles, and modulates 
the phosphorylation of SR proteins. Depletion of MALAT1 is 
sufficient to alter the patterns of alternative splicing of a subset 
of mRNAs. 



LincRNA-p21 

Recent studies identified a number of lincRNAs induced by 
the p53 tumor-suppressor gene (10). LincRNA-p21 is located 
upstream of the CDKN1 A gene on mouse chromosome 1 7 and 
is directly activated by p53 in response to DNA damage. 
LincRNA-p21 acts as a transcriptional repressor in the canon- 
ical p53 pathway and plays a role in triggering apoptosis. 
Inhibition of lincRNA-p21 alters the expression of hundreds of 
genes normally repressed by p53, potentially explaining how 
p53 can activate large numbers of genes while simultaneously 
repressing many others. LincRNA-p21 also interacts with heter- 
ogeneous nuclear ribonucleoprotein K (hnRNP-K), a well- 
known RNA binding protein that acts as a transcriptional 
repressor. A 780-nt region at the 5' end of lincRNA-p21 is nec- 
essary for interaction with hnRNP-K, which is required for 
proper genomic localization of hnRNP-K at repressed genes 
and for regulation of p53-mediated apoptosis. Although 
lincRNA-p21 has not been directly associated with disease, we 
would speculate that loss of lincRNA-p21 function could be an 
important factor contributing to cancer initiation. 

PANDA 

PANDA is also induced in a p53-dependent manner. After 
DNA damage, p53 directly binds to the CDKN1A locus to acti- 
vate PANDA (4), which appears to possess decoy function. 
PANDA inhibits the expression of apoptotic genes by directly 
binding to and sequestering the transcription factor NF-YA 
away from target gene promoters. PANDA is overexpressed in 
a subset of human breast cancers, and its depletion can sensi- 
tize cells to chemotherapeutic agents. 

PCAT-1 

Prensner and colleagues used high throughput RNA-Seq with a 
large panel of clinical samples to comprehensively evaluate 
the ncRNAs dysregulated in prostate cancer (42). They identi- 
fied approximately 1,800 lincRNAs in prostate tissue, of which 
121 were transcriptionally dysregulated in prostate cancer. 
Among them, PCAT-1 (prostate cancer associated transcript -1 ) 
showed tissue-specific expression and was selectively upregu- 
lated in prostate cancer. Like HOTAIR, PCAT-1 functions pre- 
dominantly as a transcriptional repressor by facilitating 
trans-regulation of genes preferentially involved in mitosis and 
cell division, including known tumor suppressor genes such as 
BRCA2. The discovery of PCAT-1 highlights the usefulness of 
unbiased transcriptome analysis when investigating the actions 
of IncRNAs in cancer. 

GAS5 

GAS5 was originally identified as a gene highly expressed in 
cells whose growth was arrested (43). The human GAS5 gene 
is transcribed from chromosome 1q25.1 and is alternatively 
spliced. GAS5 sensitizes cells to apoptosis by regulating the 
activity of glucocorticoids in response to nutrient starvation 
(21). GAS5 binds to the DNA-binding domain of the GR, 
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where it acts as a decoy preventing the GR from interacting 
with cognate glucocorticoid response elements (GRE). The 
binding of GAS5 to the GR is sufficient to repress GR-induced 
genes such as clAP2. 

Implicating GAS5 in breast cancer is the observation that lev- 
els of GAS5 transcript are significantly lower in breast cancer 
cells than in unaffected normal breast epithelium (44). In addi- 
tion, genetic aberrations at the GAS5 locus have been found in 
many types of tumors, including melanomas and breast and 
prostate cancers (45, 46), though their functional significance 
has not yet been established. The GAS5 gene locus has also 
been linked to increased susceptibility to autoimmune dis- 
orders such as systemic lupus erythematosus in the mouse 
BXSB strain (21). Chromosomal translocations affecting the 
1q25 locus containing the GAS5 gene have been detected in 
melanoma, B-cell lymphoma, and prostate and breast cancer. 

CONCLUDING REMARKS 

In recent years, technological advances in high-throughput se- 
quencing have enabled us to identify numerous ncRNAs, 
which has improved our appreciation for the complexity of 
transcriptome and enabled us to demonstrate the dysregulated 
expression of a number of IncRNAs in various types of cancer. 
The affected molecules are now thought to function as onco- 
genes or tumor suppressors. However, the biological and mo- 
lecular characteristics of most IncRNAs remain unknown, and 
much effort will be needed before a full understanding of their 
roles in normal and cancer cells is attained. This work will in 
part entail bioinformatics analysis, and secondary structure 
prediction will be important for identifying potentially func- 
tional motifs in IncRNAs. Elucidation of the structures of genes 
encoding IncRNAs, including the promoters, transcription start 
sites and enhancers will be essential for understanding the 
mechanisms governing their spatiotemporal expression 
patterns. Moreover, greater knowledge of the gene structures 
will lead to complete cloning of IncRNAs, which will facilitate 
functional studies using expression constructs and mouse 
models. Analysis of genetic and epigenetic alterations of 
IncRNA genes will also provide clues to understanding their 
pathological roles, and the mechanisms by which such alter- 
ations affect IncRNA function must be carefully characterized. 
Although the roles played by IncRNAs in cancer have just be- 
gun to be revealed, it is anticipated that advances in the study 
of IncRNAs will yield new diagnostic biomarkers as well as 
RNA-based therapeutic strategies. 
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